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© The invention is characterized as a data pro- 
cessing architecture and method for multi-stage pro- 
cessing of mail, using knowledge based techniques. 
The system includes OCR-scanning a multipart ad- 
dress field of a mail piece at a sending location, the 
address field including at least two portions, a first 
stage routing portion (destination city, state, country, 
zip code) and a second stage routing portion 
(destination street address, building floor, corporate 
addressee internal routing). 

At the sending location, the image of the entire 
address field is captured by an OCR head and 
stored in memory, A serial number is printed on the 
mail piece. The first routing portion is then converted 
into sorting signals to sort the mail piece to a truck 
at the sending location which is to be dispatched to 
the city, state and country indicated in the first stage 
routing portion. 

Then, while the mail piece is in transit by truck 
to the destination city, the image of the second 
stage routing portion is analyzed by a knowledge 



base processor to resolve street addresses, building 
floor, corporate addressee internal routing informa- 
tion and addressee name. The deferred execution of 
the analysis by the knowledge base processor is 
available because of the sporadic volume of mail 
pieces submitted to the system. 

While the mail piece is in transit on the truck, 
the knowledge processor completes its analysis and 
is able to transmit by electronic communications link 
to the destination location, the information that the 
mail piece is on its way and the second stage 
routing information needed to automatically sort and 
deliver the mail piece to its corporate addressee. 

In addition, the knowledge base processor ana- 
lyzes the aggregate volume of mail flowing through 
the postal system and transmits to each destination 
location, inventory and resource allocation informa- 
tion necessary to plan for the equipment and man- 
power needed in the following days to sort and 
deliver the mail at each destination location. 
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SYSTEM AND METHOD FOR DEFERRED PROCESSING OF OCR SCANNED MAIL 



This invention relates to the area of automated 
mail processing and more particularly to the use of 
optical character recognition and knowledge base 
systems methods in mail processing to effect cor- 
rection of optical recognition errors, aid an operator 
in disambiguating address misreads, and validating 
the correctness of the address down to the delivery 
sequence. 

In the United States in 1988 approximately 160 
billion pieces of mail were delivered by the United 
States Postal Service. The volume of mail is grow- 
ing at a compound rate of approximately 5% a 
year. To handle such massive volume of mail, 
several methods utilizing automated means have 
been experimented with and installed on a limited 
operational basis. One means of automated mail 
processing utilized today revolves around optical 
character recognition (OCR). The OCR is capable 
of scanning the address area on an envelope and 
interpreting it into machine-readable alphabetic and 
numeric characters. State of the art optical char- 
acter recognition is restricted to machine printed 
addresses and is unusable for handwritten or hand- 
printed characters. Additionally, OCR is prone to 
misread characters and on occasion has difficulty 
in discerning lines in the address block and, when 
there is interference on the face of the envelope, is 
unable to find the address box. When a misread 
occurs, the mail piece cannot be properly sorted 
and either is rejected or an attempt is made to 
enter the correct address read by utilizing the 
directory of the street or city names. Since most 
mail reading OCRs process mail pieces at the rate 
of 600 to 800 per minute, the amount of time in 
which misread correction can be performed limits 
the correction to only the most superficial errors 
and does not allow for validation of the OCR read 
of the address using all constituent information in 
the address box. For example, no attempt is made 
to determine that a given street actually exists 
within a certain zip code and that the city/state 
match the zip code and above all that the ad- 
dressee actually exists at this location. The inability 
to do complete validation and verification on the 
OCR scan has limited the utility of OCRs to mainly 
reading the outgoing city/state/country/destination 
which is normally found on the bottom most line of 
the address box. The other lines of the address, 
which can number an additional five lines and the 
information to sort a letter down to delivery se- 
quence within a building, cannot at this time be 
scanned, OCRed, validated and used for sortation 
down to delivery sequence. 

Without the ability to validate the correctness 
of the OCR interpretation of all lines in the address 



block, the reliability of sortation down to delivery 
sequence drops dramatically. This leads to the 
situation where a major part displaceable cost in 
the mail sortation process results in the handling of 

5 mail after it arrives at its destination post office, 
whereas the reliability of OCR at that point drops 
dramatically to approximately 25% reliability. 

An alternative to the use of OCRs is the 
preprinting of envelopes with a bar code of phos- 

70 phorescent ink encoding that allows machines to 
simply and accurately read address information off 
the envelope without having to do optical character 
recognition. The methods related to pre-printing 
envelopes however, fall short since they are only a 

75 relatively small fraction of the mail volume and 
hence from a logistics standpoint only provide use- 
ful sortation to the destination post office and can- 
not substantiate a large enough volume of mail to 
make it worthwhile to process the mail automati- 

20 cally down to the delivery sequence. 

The invention disclosed herein addresses the 
problem of performing with reliability, mechanical 
separation of mail down to the "delivery sequence" 
utilizing optical character recognition and image 

25 scanning techniques coupled with knowledge 
based operator-assisted disambiguation and valida- 
tion of the address data down to the delivery se- 
quence. The invention also includes a method to 
do this off-line to re-associate the sortation mforma- 

ao tion with the mail piece and optimize mail process- 
ing by utilizing apriori knowledge of the mail dis- 
tribution. 



35 Objects of the Invention 



It is therefore an object of the invention to 
provide an improved technique for processing OCR 
scanned mail. 

40 it is another object of the invention to provide 

an improved technique for the multi-stage process- 
ing of mail. 

It is still a further object of the invention to 
provide an improved technique for analyzing the 
45 aggregate volume of mail flowing through the 
postal system, for the allocation of equipment and 
personnel at apparent destination locations. 

so Summary of the Invention 



These and other objects, features and advan- 
tages of the invention are accomplished by the 
system for deferred processing of OCR scanned 
mail, disclosed herein. The invention is character- 
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ized as a data processing architecture and method 
for multi-stage processing of mail, using knowledge 
based techniques. The system includes OCR-scan- 
ning a multipart address field of a mail piece at a 
sending location, the address field including at 
least two portions, a first stage routing portion 
(destination city, state, country, zip code) and a 
second stage routing portion (destination street ad- 
dress, building floor, corporate addressee internal 
routing). 

At the sending location, the image of the entire 
address field is captured by an OCR head and 
stored in memory. A serial number is printed on 
the mail piece. The first routing portion is then 
converted into sorting signals to sort the mail piece 
to a truck at the sending location which is to be 
dispatched to the city, state and country indicated 
in the first stage routing portion. 

Then, while the mail piece is in transit by truck 
to the destination city, the image of the second 
stage routing portion is analyzed by a knowledge 
base processor to resolve street addresses, build- 
ing floor, corporate addressee internal routing in- 
formation and addressee name. The deferred ex- 
ecution of the analysis by the knowledge base 
processor is available because of the sporadic vol- 
ume of mail pieces submitted to the system. 

While the mail piece is in transit on the truck, 
the knowledge processor completes its analysis 
and is able to transmit by electronic communica- 
tions link to the destination location, the information 
that the mail piece is on its way and the second 
stage routing information needed to automatically 
sort and deliver the mail piece to its corporate 
addressee. 

fn addition, the knowledge base processor ana- 
lyzes the aggregate volume of mail flowing through 
the postal system and transmits to each destination 
location, inventory and resource allocation informa- 
tion necessary to plan for the equipment and man- 
power needed in the following days to sort and 
deliver the mail at each destination location. 



Brief Description of the Drawings 



These and other objects, features and advan- 
tages of the invention will be more fully appre- 
ciated with reference to the accompanying figures. 
Fig. 1 is a system diagram of the invention. 
Fig. 2 is a functional block diagram of the ar- 
chitecture at the sending location, in accordance 
with the invention. 

Fig. 3 illustrates the relationship between the 
address block on the physical mail piece and 
the captured image of the address block and the 
resolved alphanumeric address data. 
Fig. 4 illustrates a generalized format for the 



mail piece electronic folder. 
Fig. 5 is an architectural block diagram of the 
receiving location, in accordance with the inven- 
tion. 

s Fig. 6 is a system block diagram illustrating the 
relationship between the sending location, the 
receiving location, and the off-line or remote 
processing system. 

Fig. 7 illustrates sample operator assists at a 
10 workstation, in accordance with the invention. 

Fig. 8 illustrates the normal case where the 
address block is processed off-line and all fields 
are successfully verified with no operator inter- 
vention. 

75 Fig. 9 illustrates an operator display where a 
misoriented address block is located and des- 
ignated by the operator and then an OCR read- 
ing of the manually located address block is 
performed. 

20 Fig. 10 illustrates the operator display for case 3 
where there has been an OCR misread. 
Fig. 11, consisting of Figs. 11 A, 11Band 11C, is 
a process flow diagram illustrating the method of 
the invention as carried out at the sending loca- 

25 tion. 

Fig. 12, consisting of Figs. 12A and 12B, is a 
process flow diagram illustrating the method of 
the invention at the off-line or remote processing 
location. 

30 Fig. 13 is a process flow diagram illustrating the 

method of the invention at the receiving location. 

Fig. 14 is an architectural diagram of the off-line 

or remote processing system 14. 

Fig. 15 illustrates an example of the street 
35 name/city data base and the street number/zip. 

data base in the memory 19" of the off-line or 

remote processing system 14. 

40 Description of a Mode for Carrying Out the Inven- 
tion 

The invention is directed to automated mail 
handling and focuses on providing a highly reliable, 

45 generally implemented methodology that supports 
mechanical separation of mail down to the 
"delivery sequence." This implies that when the 
automated phases of mail piece processing are 
completed, the letters are in their carrier delivery 

so sequence down to suite/apartment within a build- 
ing. To accomplish this, reliable, consistent analy- 
sis of up to five lines of address information is 
required. Current state of the art using optical char- 
acter recognition (OCR) analyzes only one line. 

55 Nearly one quarter of postal labor is currently in- 
volved in manual sortation of mail down delivery 
sequence. 

The invention applies the principles of "Just In 
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Time Manufacturing" (JITM) to automate mail pro- 
cessing. The general steps of the process are as 
follows: 

1. Mail pieces are scanned by a state of the art 
optical character recognition (OCR) which re- 
solves the city/state/zip line of the address. This 
is sufficient to route the letter to a district deliv- 
ery post office. Each mail piece is then "bar 
coded" with an ID number and dispatched to its 
delivery post office. 

2. The image scan of the address block (which 
contains the remaining 3-4 lines of address in- 
formation) is captured, compressed to about one 
kilobyte (KB) and stored on a disk. 

3. While the physical mail pieces are in route to 
their delivery post offices, (i.e., via truck, train, or 
plane, or on a dolly within the same post office) 
their respective address block images are pro- 
cessed off-line in specially configured worksta- 
tions located either at the sending location or at 
a remote location. 

4. In the workstation, each address block image 
is processed to resolve sortation down to deliv- 
ery sequence as follows: 

a. Off-line OCR is performed on the remain- 
ing 3-4 address lines of the address block in 
either a workstation or in a LAN server pro- 
cessor. 

b. The address data is reviewed against a 
Post Knowledge Base. If there are no appar- 
ent OCR misreads, then the system will: 

- Validate and cross check all the address 
fields including the recipient. 

- Resolve any address ambiguities such as 
incomplete address. 

- Derive the delivery sequence within a build- 
ing. 

c. If OCR misreads are encountered, then the 
system will: 

- Perform OCR misread correction using al- 
gorithms for spelling correction. The correc- 
tion candidate information is displayed to the 
workstation operator along with the original 
image. The operator makes the final correc- 
tion decision. This provides advantages over 
the operator re-keying the address correction, 
including economy of keystrokes and avoid- 
ance of operator errors. - Knowledge based 
disambiguation of incomplete address data. 

5. At the completion of Step 4, all the address 
data required to machine sort the mail piece 
down to delivery sequence will have been re- 
solved. The information is then batched by des- 
tination post office and transmitted via a high 
speed telecommunications network to the re- 
spective destination post offices. 

6. The address information is re-associated with 
each physical mail piece when it arrives at the 



destination post office via the ID number pre- 
viously bar coded on the envelope. The physical 
mail piece and its sortation information hence 
"come together" in Just In Time Manufacturing 
5 fashion and the sorting is completed down to 
delivery sequence. 
Operational refinements that can be overlaid on 
the basic invention are: 

1. The order in which mail pieces are processed 
w at the workstation can be prioritized based on 

destination post office distance (travel time). 
Those mail pieces with the longest overland 
travel time can be assigned the lowest priority, 
since there is more time to process their ad- 
15 dress image. Those mail pieces which will be 
"turned around" in the same post office will be 
given the highest priority. 

2. Handwritten envelopes can be detected as 
non-machine readable and directed to special 

20 operators rated for re-keying skills (i.e. since no 

OCR is possible on handwriting). The re-keying 
is automatically terminated as soon as enough 
information has been entered to complete the 
sortation to delivery sequence. Termination de- 

25 cisions can be made on a word-by-word basis. 

3. Mail pieces that were rejected by the OCR 
because the address block could not be found, 
are bar coded with an ID and their image dis- 
played at an operator's workstation. Using a 

go mouse, the operator can confirm the location of 
the address box. A video sensing algorithm al- 
lows the perimeter of the address box to be 
automatically calculated once the cursor has 
been placed on any part of the address box. 

35 Operator assisted OCR processing and knowl- 
edge base disambiguation can be used to start 
the mail piece on its way and complete the 
JITM sorting at the delivery post office per 
Steps 5 and 6 above. 

40 The apparent operational benefits to the postal 

system are: 

1. Better use of equipment, people and optimiz- 
ation of sort allocation by knowing the "exact" 
distribution of incoming mail before the start of 

45 processing. 

2. Improved theft security by reducing the num- 
ber of human handling steps. 

3. Lower peak mail processing requirements by 
extending the processing window by the length 

so of the transit times. 

4. Ability to become a major player in "moving 
work to people" since the workstations do not 
need to be co-located with the sorting. 

5. The proposed automation process uses gen- 
55 eral purpose hardware for workstations and uti- 
lizes the postal system's current investment in 
on-line OCRs. 

6. The technology provides an efficient, auto- 
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mated process for both printed and handwritten 
mail pieces. This is a prerequisite for a global 
solution and to achieving economies of scale 
that justify the automation. 
Turning now to the figures, Fig. 1 is an overall 
system diagram of the invention and Fig. 2 is an 
architectural diagram of the sending location 10. 
Mail pieces which originate at the sending location 
10 are read through the optical character recogni- 
tion machine (OCR) 20. Fig. 3 illustrates a physical 
mail piece 22 which has a destination address 
block 45 which includes the city/state/zip address 
data 30 and the addressee, street name and street 
number data 32. The OCR 20 scans the physical 
mail piece 22 and captures the image 45 of the 
address block as a two-dimensional array of picture 
elements in a bit plane. The captured image 45 
includes the image 30' of the city/state/zip informa- 
tion 30 and it further contains the image 32 of the 
addressee and street name and street number 32. 
The OCR 20 attempts to resolve the image 30' of 
the city/state/zip information 30 into an alphanu- 
meric character string of resolved address data 42. 
In accordance with the invention, the system defers 
the resolution of the image 32 of the addressee, 
street name and street number information 32 until 
later. 

As is seen in Fig. 2, a locally originated mail 
piece is input to the conveyor 12 and passes 
beneath the OCR 20 where it is scanned. The mail 
piece then continues on the conveyor belt and the 
bar code printer 21 prints a serial number 24 onto 
the mail piece 22. In its normal operation, the OCR 
20 will read the second portion 30 of the address 
block 45 consisting of the city, state, country and 
zip code destination, and will enter this into the 
resolved address data block 40 in the memory 19 
of Fig. 2. The data processing system of Fig. 2 
includes the CPU 23 which is connected by means 
of the bus 11 to the memory 19, the OCR 20 and 
the bar code printer 21. The system of Fig. 2 
further includes the workstations 31, the bar code 
reader 37, the sorting machine 33 connected by 
the connection 35, the mass store 25 and the 
communications adapter 27 all interconnected by 
the system bus 11. The communications adapter 
27 communicates over the communications link 29 
to the receiving location 28 and the off-line or 
remote processing system 14, as is shown in Fig. 
6. 

The resolved address data block 40 shown in 
Fig. 2 has two portions, the first portion 42 stores 
the resolved alphanumeric string for the city, state, 
zip code or country as was recognized by the OCR 
20 in its scanning operation. The second portion 44 
of the resolved address data block will contain the 
resolved addressee and street name and street 
number information which will eventually be output 



during the course of the operation of the invention. 

The resolved city, state, zip code and/or coun- 
try information in portion 42 of the resolved ad- 
dress data block 40 is output to the sorting ma- 

5 chine 33 and is used to physically sort the mail 
piece 22 into an appropriate pocket in the sorting 
machine. The physical pocket in the sorting ma- 
chine 33 is associated with a particular mode of 
transportation, whether by airplane, truck, train or 

10 other mail transportation medium, which is destined 
to the city and state and country named in the 
destination address block 45. 

As the mail piece 22 passes out of the OCR 
20, the bar code printer 21 prints a bar code 24 

75 representing an identification number 24' which will 
allow the mail piece 22 to be re-associated with the 
information in the resolved address data block 40. 
That re-association, in one embodiment of the in- 
vention, is made at the receiving location 28 for the 

20 mail piece, where the resolved addressee, street 
name and street number information 44 can be 
associated with a particular mail piece 22 by the 
identity of the identification number 24 . In an al- 
ternate embodiment of the invention, where the 

25 sorting machine 33 is not electronically connected 
by the link 35 to the OCR 20 and the CPU 23, the 
bar code 24 can also be used by the bar code 
reader 37 to enable the accessing of the appro- 
priate city/state/zip information to control the sor- 

30 ting machine 33. 

When the mail piece 22 is scanned by the 
OCR 20, the captured image 45 is stored as a 
two-dimensional bit plane of picture elements in the 
mass storage 25, which can be for example a large 

35 capacity magnetic DASD. The image 45 is stored 
in conjunction with its identification number 24 as 
the image data block 17 in Fig. 2, and is accessible 
by its identification number. That same identifica- 
tion number 24 is also another portion of the 

40 resolved address data block 40, to facilitate acces- 
sing thereof. Still further, that identification number 
24 is imprinted by the bar code printer 21 as the 
bar code 24 onto the face of the mail piece 22. 
If the image 30 of the captured image 45 of 

45 the address block is successfully resolved, then the 
city, state, zip and country information can be 
output by the OCR 20 in conjunction with the CPU 
23 to the sort machine 33 to physically sort the 
mail piece. 

so If instead, the image 30' of the captured image 

45 containing the city, state, zip and/or country 
information is not successfully recognized, then the 
CPU 23 directs the conveyor 12 to send the mail 
piece 22 to the reject tray 18, the mail piece 22 still 

55 having the bar code 24 imprinted thereon with the 
identification number. Thereafter, by further data 
processing analysis and/or by additional operator 
intervention and interpretation, the unresolved por- 
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tions of the city/state/zip/country codes in the im- 
age 30 can be determined and input to the portion 
42 of the resolved data block 40. Thereafter, the 
reject mail in the temporary holding tray 18 can be 
fed into the bar code reader 37 associated with a 
sorting machine 33, for the identification number 
for each mail piece is read. That number is then 
associated by the CPU 23 with the corresponding 
resolved address data block 40 and the information 
in the portion 42 can be accessed to control the 
sort machine 33. The sort machine 33 can then 
properly sort the mail piece 22 into the appropriate 
transport 26. 

After the first sorting operation at the sending 
location 10, the mail piece 22 is physically loaded 
onto a carrier 26 such as a truck, airplane or other 
appropriate transportation medium, and is phys- 
ically transported to the postal destination 28. 

Where the portion 42 of the address data block 
40 gives the resolved city and state information 
indicating that the mail piece is addressed to a 
recipient located at the same sending location, then 
use is made of the resolved addressee and street 
number information in portion 44 of the address 
data block 40. If the portion 42 of the resolved 
address data block 40 indicates that the city, state, 
zip code is that of the sending location, then the 
mail piece is characterized as turnaround mail and 
will typically be processed with a higher priority 
than remote destination mail. The local mail is 
preferentially processed by performing the resolu- 
tion of the addressee and street name and street 
number information image 32' of a captured image 
45 , the reduction of this information being per- 
formed prior to those operations for remotely des- 
tined mail pieces. In a similar manner, prioritization 
based upon the estimated travel time for remotely 
destined mail pieces can be performed, giving a 
lower priority to those mail pieces whose physical 
transport duration is longer. 

Each captured image 45' of the address block 
stored on the mass storage device 25 is processed 
off-line to resolve the addressee and the street 
name and street number information image 32'. 
This information, once resolved, will then be en- 
tered as alphanumeric data into the portion 44 of 
the resolved address data block 40. This operation 
is carried out by the CPU 23 using character rec- 
ognition algorithms and knowledge base verification 
information, in accordance with the invention. Since 
this processing can be deferred from the initial 
OCR scan of the mail piece, this process of inter- 
pretation of the addressee information image 32' 
can be performed in a remote processing facility 
such as the remote processing system 14 shown in 
Fig. 6. Workstations 31 " at the system 14 can be 
used for any needed operator intervention in the 
interpretation of the images 32'. 



Once the addressee and street name and 
street number information is converted into an al- 
phanumeric string in portion 44 of the address data 
block 40, the resolved address data block 40 can 
5 be transmitted through the communications link 
adaptor 27 and over the communications link 29 to 
the destination location 28. This is achieved by 
assembling the mail piece electronic folder 16 as 
shown in Fig. 4 which is a message data block 
10 which includes the serial number 24', the 
city/state/zip alphanumeric information 42 , the ad- 
dressee, street name and street number alphanu- 
meric information 44, and optionally, the captured 
image 45 of the address block, in the form of the 
15 bit plane of picture elements. In those instances 
where the addressee, street name and street num- 
ber image 32 have not yet been resolved, then the 
portion 44 of the mail piece electronic folder 16 will 
be empty and it will be necessary to to include the 
20 captured image 45 in the mail piece electronic 
folder 16, when it is transmitted to either the off- 
line or remote processing system 14 or alternately 
to the receiving location 28, where the addressee, 
street name and street number image 32' can be 
25 resolved. 

At the destination location 28, the resolved 
address data block 40 will have its information 
used for providing the addressee and street name 
and street number information to enable routing the 
30 mail piece at the destination location 28. Fig. 5 
shows an architectural diagram of the receiving 
location 28, where the transport 26 delivers the 
mail piece 22 onto the conveyor 12'. The mail 
piece 22 has its bar code 24 read by the bar code 
35 reader 37 and that serial number is then asso- 
ciated by the CPU 23' with the address data block 
40 which has been received over the communica- 
tions link 29 by the communications adaptor 27 . 
The addressee, street name and street number 
40 information 44 in the received address data block 
40, is then applied by the CPU 23' to the sort 
machine 33 to perform the sortation of the mail 
piece 22 down to the delivery sequence. The sort- 
ed mail piece 22 can then be locally delivered at 
45 the receiving location 28 to the addressee at his 
particular street and street number. The portion 44 
of the address data block 40 can also include an 
addressee building floor number. The CPU 23' can 
control the sorting of the mail to appropriate local 
so mail routes, in a street name order and address 
number order and a building floor order, if appro- 
priate. 

Since the resolved address data block 40 can 
be transmitted over the communications link 29 at 
55 an earlier time than the expected time of arrival of 
the physical mail piece 22 at the receiving location 
28, the information contained in the resolved ad- 
dress data block 40 can be used at the receiving 
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location 28 to allocate resources at the destination 
location. Resource allocation information can be 
computed by the CPU 23' and output at a display 
and/or printer or at the workstations 31 . Where an 
address data block is misread or cannot be read 
by the OCR 20 at the sending location 10, a 
sequence of operator intervention steps and/or al- 
gorithmic interpretation steps can be carried out 
As is shown in Fig. 8, the scanned image 45 of the 
address block can be displayed at an operator 
workstation 31 at either the sending location 10, the 
off-line or remote processing location 14, or at the 
receiving location 28, and the alphanumeric char- 
acter string 54 resulting from the OCR recognition 
operation can also be displayed. Fig. 7 illustrates 
examples of operator assists by the workstation 31 , 
31 ' or 31 " to enable spelling aid and validation of 
company names, zones, address systems, and 
miscellaneous information to assist the operator in 
making a decision about how to correct an OCR 
misread alphanumeric string 54. Fig. 9 illustrates 
the case where the address block 45 on the mail 
piece 22 is not properly aligned and the operator 
can use the mouse 46 to designate the portion of 
the image 45 representing the destination address. 
The workstation 31 can then automatically compute 
the correct alphanumeric character string 54. In 
Fig. 10, a third case is shown where the address 
block 45 has a poor image 45 which is misread by 
the OCR as the alphanumeric string 54. The work- 
station 31, 31 or 31 can perform a data base 
lookup of street names, for example, whose spel- 
lings most closely approximate the alphanumeric 
character string 54. A first alternative 56A and a 
second alternative 56B are then presented to the 
operator who can then select the appropriate alter- 
native spelling, which is then inserted as the se- 
lected street name into the portion 44 of the ad- 
dress data block 40. 

Fig. 11, consisting of Figs. 11 A, 11 B and 11C, 
is a sequence of method steps performed at the 
sending location 10 to process an incoming mail 
piece in the system shown in Fig. 2. At step 60, a 
mail piece is input to the conveyor 12 and an ID 
number is assigned to the mail piece and an ad- 
dress data block is created. Then in step 62, the 
mail piece is scanned by the OCR 20 and the 
image 45' is captured and stored in the image data 
block 17 of the memory 19. Then in step 64, the 
bar code printer 21 prints the ID number 24' on the 
mail piece 22. The image 45' of the mail piece is 
buffered in the memory 19 and optionally in the 
mass store 25. The CPU 23 then in step 66, 
attempts to locate the address block in the cap- 
tured image 45'. In step 68, if the address block is 
located, then the process passes to step 72. How- 
ever, if the address block has not been located, 
then at step 70, an operator at one of the work- 



stations 31 will assist in locating the address block, 
as was shown for case 2 in Fig. 9. In step 72, an 
attempt is made to resolve the image portion 30 
for the city/state and zip code. In step 74, if the 

5 image 30 is resolved, then the process passes to 
step 76 where optional editing of the image data 
can be performed and then the resolved alphanu- 
meric string for the city/state and zip are buffered 
in portion 42 of the address data block 40 and are 

10 also output to the mail sorting machine 33 to sort 
the mail piece on the conveyor 12. At step 74 if the 
city/state/zip code image 30' is not resolved, then 
the process flows to step 82 where the mail piece 
22 is stored in the temporary holding tray 18. Then, 

75 one of the operators at the workstations 31 will 
perform an operator assist to resolve the 
city/state/zip code information in step 84. This in- 
formation is then stored in the portion 42 of the 
address data block 40. In step 86, the mail piece 

20 exits the holding tray 18 and the bar code reader 
reads the ID number for the mail piece and uses 
the ID number in step 88 as the address for acces- 
sing the city/state/zip information from portion 42 of 
the address data block 40 in the memory 19 and 

25 this information is then output in step 90 to the mail 
piece sorting machine 33 to sort the mail piece on 
the conveyor belt 12. Then in step 80, the sorted 
mail piece is transferred from the conveyor belt 12 
to the transport 26 for physical transportation to the 

30 destination location 28. Then the process flows to 
step 92 where the mail piece electronic folder 16 is 
assembled as shown in Fig. 4, and this telecom- 
munications message is then output by the com- 
munications adaptor 27 on the communications link 

35 29 to either the off-line/remote processing system 
14 or to the receiving location 28, where the image 
32' of the addressee, street name and street num- 
ber can be resolved into alphanumeric strings. Al- 
ternately, the resolution of the image 32' for the 

40 addressee, street name and street number can be 
performed at the sending location 10 by the CPU 
23. 

For the example where the resolution of the 
image 32' of the addressee, street name and street 

45 number information is to be performed at the off- 
line or remote processing location 14, Fig. 12 illus- 
trates the sequence of operational steps for per- 
forming that resolution. In Fig. 12, step 94 receives 
and stores the mail piece electronic folder 16. Then 

so in step 96, a second pass of the stored image of 
45' is performed for character recognition of the 
image 32' of the addressee, street name and street 
number information. In step 98, a validation test is 
performed to determine if the street address which 

55 is resolved in step 96 is a street address which 
exists in the city information which was resolved in 
step 72. This can be performed by a data base 
comparison, using a data base containing all of the 
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valid street names for each of a plurality of cities. If 
the validation test is not passed, then an operator 
assist can be provided to interpret and correct 
either the resolved street address information or 
city information. Then in step 100, a validation test 
can be performed to determine if the street number 
range resolved in step 96 and the street match the 
zip code resolved in step 72. If the validation test is 
not passed, then an operator assist step can be 
performed. Then in step 102, a validation can be 
performed to determine if the addressee informa- 
tion resolved in step 96 corresponds to an ad- 
dressee name which is shown to exist at the street 
address which was resolved in step 96. If the 
validation test is not passed, then an operator as- 
sist step can be performed. Then in step 104, the 
mail piece electronic folder 16 can have its portion 
44 augmented with the additional resolved informa- 
tion for the addressee, street name and street 
number information which was resolved in step 96 
and which was validated in steps 98, 100 and 102. 
Then in step 106, the mail piece electronic folders 
16 can be sorted and batched by the identity of the 
receiving location 28. Then in step 108, statistics 
can be compiled as to the volume of mail which is 
directed to each respective receiving location 28. 
This information can be used at each respective 
receiving location 28 to allocate resources neces- 
sary to handle the physical mail which is now en 
route on the transport 26. Then in step 110, the 
sorted and batched mail piece electronic folder 16 
can be transmitted over the communications link 
29 to the respective receiving locations 28. 

The data processing architecture for the off-line 
or remote processing system 14 is shown in more 
detail in Fig. 14. The system bus 11 interconnects 
the memory 19", the CPU 23", the mass store 25" 
and the workstations 31 , to the communications 
adaptor 27 which is connected to the communica- 
tions link 29 from the sending location 10. The mail 
piece electronic folder 16 in Fig. 4 is received over 
the communications link 29 by the communications 
adaptor 27 and the data therein is stored in the 
memory 19 in the address data block 40 and the 
image data block 17, addressable by the corre- 
sponding serial number 24 . If the addressee in- 
formation and/or the street name and street num- 
ber information has not yet been resolved at the 
sending location 10, then the off-line/remote pro- 
cessing system 14 will carry out the resolution of 
this information, to insert alphanumeric strings re- 
presenting this information into the portion 44 of 
the address data block 40. 

The OCR program 132 in memory 19', will 
analyze the image portion 32' of the captured 
image 45 stored in the image data block 17 and 
will produce an alphanumeric string representing 
the street name. Then, step 98 in the process flow 



diagram of Fig. 12 will perform a validation test to 
determine if this street name exists in the city 
whose name has been resolved and is currently 
stored in the portion 42 of the address data block 

5 40. The knowledge base program 134 performs a 
check of the street name/city data base 98' which 
is shown in greater detail in Fig. 15. If the city 
name for example is "Springfield" in portion 42 of 
the address data block 40, and the OCR program 

iq 132 has output the alphanumeric string "Arbor Dr." 
for the street name, then the data base 98 is 
checked to validate that this street name actually 
occurs in the city of Springfield. If it occurs, then 
the process flow diagram of Fig. 12 passes to the 

75 next step. However, if there is no corresponding 
street name match, then an initial check is made to 
determine if there is a simple spelling error in the 
alphanumeric string output by the OCR. This is 
performed by the spell correction program 130 in 

20 the memory 19 , which can be for example the 
spelling correction program described in USP 
4,328,561 by Rosenbaum, et al. entitled "Alpha 
Content Match Prescan Method for Automatic 
Spelling Error Correction," which is assigned to the 

25 IBM Corporation and incorporated herein by refer- 
ence. For example, if the spelling of the alphanu- 
meric string output by the OCR program 132 for 
the street address is "Arbar," the spelling correc- 
tion program 130 will be able to identify that there 

30 is a close match between the misspelled "Arbar" 
and the data base occurrence of the name "Arbor." 
The knowledge base program 134 can then 
present to the operator at the workstation 31 , the 
misspelled alphanumeric character string output 

35 from the OCR program 132, and the suggested 
correct spelling for the street name output by the 
spell correction program 130, and the operator at 
the workstation 31 can indicate which spelling is to 
be selected for insertion in portion 44 of the ad- 

40 dress data block 40 in memory 19 . 

The process flow diagram of Fig. 12 continues 
to step 100 where a validation is performed on the 
number range of the street number which is output 
from the OCR program 132 in its attempt to resolve 

45 the street number portion of the image 32 of the 
captured image 45 stored in the image data block 
17. This validation is performed by the knowledge 
base program 134, which makes use of the street 
number/zip data base 100 which is shown in great- 

50 er detail in Fig. 15. If the zip code value resolved in 
portion 42 of the address data block 40 is "99110," 
and if the validation step 98 for the street name has 
resulted in a resolved street name of "Arbor Dr." 
which is now stored in portion 44 of the address 

55 data block 40, and if the OCR program 132 has 
output a suggested street number of "500," then 
the knowledge base program 134 accesses the 
street number/zip data base 100' to determine if 



9 



15 



EP 0 424 728 A2 16 



the string of "500" suggested by the OCR program 
is a valid number within the address range for the 
zip code value "99110." As can be seen by refer- 
ence to the street number/zip data base 100 in 
Fig. 15, the OCR suggested value of "500" is 
within the range for the zip code value "99110," 
and therefore the suggested string "500" output by 
the OCR program 132 is stored in portion 44 of the 
address data block 40. If the validation test for the 
street number had not been passed, then the OCR 
suggested value of "500" would have been pre- 
sented to the operator at the workstation 31 " along 
with the captured image 45 , so that the operator 
could key in a correct street number value which 
would then be stored in portion 44 of the address 
data block 40. 

In a similar manner, the process flow diagram 
of Fig. 12 will pass to step 102 where a validation 
is performed as to whether the addressee name 
string output by the OCR program 132 in its analy- 
sis of the image data block 17, gives the name of 
an addressee which does exist at the street ad- 
dress name and street address number which have 
been resolved and are now stored as alphanumeric 
strings in the address data block 40. The knowl- 
edge base program 134 will make use of the 
addressee/address data base 102' in the memory 
19 to make this determination in a manner similar 
to the analysis performed for the validation step 98. 
Additional information fields in the captured image 
45 can also be analyzed by the OCR program 
132, such as building floor, corporate name, and 
other address information, as appropriate. 

After all of the fields in the portion 32' of the 
captured image 45 in the image data block 17 
which are capable of resolution, have been re- 
solved into alphanumeric character strings and 
have been stored in the address data block 40, the 
mail piece electronic folder 16 is completed with 
the additional resolved alphanumeric data which is 
added to portion 44 in Fig. 4, and then the mail 
piece electronic folder 16 is transmitted by the 
communications adaptor 27 over the communica- 
tions link 29 to the receiving location 28. If all of 
the captured image 45 has been resolved, then it 
is optional whether the image data block 17 in- 
formation needs to be transmitted on to the receiv- 
ing location 28. 

Fig. 13 shows a flow diagram of the sequence 
of operational steps to perform the invention at the 
receiving location 28. In step 112, the mail piece 
electronic folder 16 is received over the commu- 
nications link 29 by the communications adaptor 
27 in Fig. 5. In step 114, the transport 26 delivers 
the physical mail pieces 22 which are input to the 
conveyor belt 12'. In step 116, the mail piece 22 
has its bar code 24 read by the bar code reader 
37 . The bar code ID read in step 118, is applied in 



step 120 to access the addressee, street name and 
street number information from the address data 
block 40 which is now stored in the memory 19', 
after having been received by the communications 

5 adaptor 27 . This addressee, street name and 
street number information is then output by the 
CPU 23' to the sort machine 33' to sort the mail 
piece 22 on the conveyor 12' so that sortation can 
be performed down to the delivery sequence. The 

io sorting steps in step 122 and 124 are resolved in 
the sorting of the mail piece to an appropriate local 
mail route, in a street name order and address 
number order and in a building floor order, if ap- 
propriate. 

75 A sortation program 140 and a resource alloca- 

tion program 142 are present in the memory 19' at 
the receiving location 28 in Fig. 5, to carry out the 
sortation of the mail pieces down to the delivery 
sequence and to carry out the provision of re- 

20 source allocation information to enable local postal 
management to have advance warning of a need 
for additional resources to handle the physical mail 
pieces to be delivered to the receiving location. 
Although a specific embodiment of the inven- 

25 tion has been disclosed, it will be understood by 
those having skill in the art that changes can be 
made to that specific embodiment without depart- 
ing from the spirit and the scope of the invention. 

30 

Claims 

1. A method for deferred processing of a mart 
piece (22) having a destination address block in- 

35 eluding a first routing portion (30) designating at 
least a destination location and a second routing 
portion (32) designating at least an addressee, 
comprising the steps of: 

capturing an image (45) of said destination address 
40 block at a sending location (10); 

analyzing said image of said destination address 
block to generate a first signal representing said 
destination location; 

sorting said mail piece at said sending location in 
45 response to said first signal for transport (26) to 
said destination location; 

transporting (26) said mail piece to said destination 
location; 

analyzing (14) said image to generate a second 
so signal representing said addressee; 

transmitting (29) said second signal to said destina- 
tion location; 

receiving said mail piece at said destination loca- 
tion and sorting said mail piece in response to said 
55 second signal for delivery to said addressee. 

2. Method of claim 1, which further comprises the 
step of: 

computing postal resource allocation information 

10 
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for said destination location in response to said first 
signal. 

3. Method of claim 1 or 2, which further comprises 
the step of: 

determining, during said step of analyzing to gen- 
erate said second signal, whether said addressee, 
as represented by said second signal, is located at 
said destination location; 

presenting said image of said destination address 
block to an operator for interpretation of addressee 
information therein, if said addressee, as repre- 
sented by said second signal, is not located at said 
destination location. 

4. Method of claim 1 , 2, or 3, in which said destina- 
tion location includes a first stage routing portion, 
and/or a state, and/or a country and/or a zip code. 

5. Method as in anyone of the preceding claims, in 
which said second routing portion designating at 
least an addressee may contain in addition to the 
addressee the name of the addressee and/or the 
destination street address, and/or building floor, 
and/or addressee internal routing. 

6. Method as in anyone of the preceding claims, in 
which the capturing of an image of said destination 
address block at the sending location is performed 
by scanning a multipart destination address field of 
said mail piece by an optical character recognition 
device (20), and storing the entire address field in a 
memory (19). 

7. Method as in anyone of the preceding claims, 
further containing the step of printing at the send- 
ing location a serial number (24), especially a bar 
code as an indicium representing the identity of 
said mail piece (22) on said mail piece. 

8. Method as in anyone of the preceding claims, in 
which the step of analyzing, be it analyzing of the 
image of the destination address block or the im- 
age to generate a second signal representing the 
addressee, is performed by a knowledge base pro- 
cessor to resolve information, such as destination 
location, destination city, state, country, zip code 
and/or street addresses, building floor, corporate 
addressee internal routing information and address- 
ee name. 

9. A system for deferred processing of a mail piece 
(22) having a destination address block (30, 32) 
including a first routing portion (30) designating at 
least a destination location and a second routing 
portion designating at least an addressee, compris- 
ing: 

means (20) for capturing an image (45) of said 
destination address block at a sending location 
(10); 

means (23) coupled to said capturing means, for 
analyzing said image of said destination address 
block to generate a first signal (24, 42) represent- 
ing said destination location; 

means (33) coupled to said analyzing means, for 



sorting said mail piece at said sending location in 
response to said first signal (24, 42) for transport to 
said destination location; 

means (27, 29) coupled to said capturing means, 

5 for transmitting said image of said destination ad- 
dress block to a second location (28); 
means (23 ) coupled to said transmitting means, for 
analyzing said image at said second location to 
generate a second signal representing said ad- 

io dressee; 

means (37 , 33 ) coupled to said analyzing means 
at said second location, for transmitting said sec- 
ond signal to said destination location; 
means (26) for transporting said mail piece from 

15 said sending location to said destination location; 
means coupled to said transmitting means at said 
second location, for receiving said mail piece at 
said destination location and sorting said mail piece 
in response to said second signal for delivery to 

20 said addressee. 

10. The system of claim 9, which further com- 
prises: 

means (19 , 140, 142) coupled to the first said 
analyzing means, for computing postal resource 
25 allocation information for said destination location in 
response to said first signal. 

11. The system of claim 9 or 10, which further 
comprises: 

means in said analyzing means at said second 
30 location, for determining whether said addressee, 
as represented by said second signal, is located at 
said destination location; 

means at said second location, for presenting said 
image of said destination address block to an oper- 
35 ator for interpretation of addressee information 
therein, if said addressee, as represented by said 
second signal, is not located at said destination 
location. 

40 
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FIG. 11 A 
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FIG.11C 
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© The invention is characterized as a data pro- 
cessing architecture and method for multi-stage pro- 
cessing of mail, using knowledge based techniques. 
The system includes OCR-scanning a multipart ad- 
dress field of a mail piece at a sending location, the 
address field including at least two portions, a first 
stage routing portion (destination city, state, country, 
zip code) and a second stage routing portion 
(destination street address, building floor, corporate 
addressee internal routing), 

At the sending location, the image of the entire 
address field is captured by an OCR head and 
stored in memory A serial number is printed on the 
mail piece. The first routing portion is then converted 
into sorting signals to sort the mail piece to a truck 
at the sending location. 

While the mail piece is in transit on the truck, 
the knowledge processor completes its analysis and 
is able to transmit by electronic communications link 
to the destination location, the information that the 
mail piece is on its way and the second stage 
routing information needed to automatically sort and 
deliver the mail piece to its corporate addressee. 
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